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Specification 
Title of the Invention 
Internet Home Page Data Acquisition Method 



5 Background of the Invention 

The present invention relates to an Internet 
home page data acquisition method of automatically 
acquiring browsed Internet home page data, i.e., the 
hypertext data of WWW (World Wide Web) home pages and, 

10 more particularly, to an Internet home page acquisition 
method which can actually shorten the line connection 
time in a communication path that occupies a telephone 
or portable phone line. 

Conventionally, acquisition of all Internet 

15 home page data has been managed on the client side. 

Cache management has been performed on the server side 
as well as on the client side. More specifically, home 
page data accessed in the past are stored in a cache on 
the server side to reduce the actual amount of access to 

20 the WWW server. 

As an example of this technique, a method is 
disclosed in "Data Transfer System" in Japanese Patent 
Laid-Open No. 10-21174 (reference 1), which is not 
limited to a WWW server and designed to suppress an 

25 increase in communication time when a client on a LAN 
(Local Area Network) requests data stored in a data 
server connected to a WAN (Wide Area Network) and the 



requested data is transferred. According to reference 1, 
in response to a data request from the client to the 
data server, the cache server transfers the requested 
data to the client if the data is stored in the cache 
5 server. If the data is not stored in the cache server, 
a data request is sent to the data server, and the data 
is transferred to the client through the cache server. 

In an environment in which a user browses data 
in a WWW server by tracing URLs (Uniform Resource 

10 Locators) one by one as in Internet surfing, the method 
disclosed in "Internet Home Page Management System", 
Japanese Patent Laid-Open No. 10-240604 (reference 2) 
has been proposed. According to reference 2, in order 
to reduce the amount of access to the network, the WWW 

15 server transmits only a changed portion of home page 
data in the WWW server. 

As disclosed in "Automatic Hypertext 
Acquisition Apparatus", Japanese Patent Laid-Open 
No. 10-207759 (reference 3), a method has been proposed, 

20 by which a user can efficiently and automatically 
acquire effective pages in a small cache area in a 
portable terminal or the like. In the method disclosed 
in reference 3, hypertext data are analyzed to trace 
links to the original hypertext data in accordance with 

25 the link tags of the hypertext data so as to store, in a 
cache, files that are likely to be downloaded, before 
the user traces the links, thereby shortening the time 



required for connection to the network. 

The following problem is, however, posed in 
the conventional method of managing acquisition of all 
Internet home page data on the client side. 
5 A client uses a method of connecting to URL 

addresses one by one and accepting data. For this 
reason, if a WWW server in which a target URL exists is 
congested, the client has difficulty in accessing the 
WWW server. In this case, the client keeps waiting for 

10 acceptance of home page data until a timeout occurs. If, 
therefore, the number of congested WWW servers increases, 
much time is required to sequentially download the home 
page data of all necessary URL addresses. As a 
consequence, if connection is made through a telephone 

15 line or the like, the occupancy of the line undesirably 
increases . 

A similar problem arises in the automatic 
hypertext acquisition apparatus disclosed in reference 3. 
According to the method disclosed in reference 3, the 

20 occupancy of a line can be decreased as compared with 

the method of making a user obtain target information by 
tracing hypertext links. If, however, files are 
downloaded by the method in reference 3, even files that 
are not necessary for the user are automatically 

25 downloaded into the client, resulting in an increase in 
traffic on the network. For this reason, the load on 
the server increases because of the processing performed 
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to shorten the line connection time. This delays a 
response from the server. As a consequence, the time 
taken for connection to the network is prolonged. 
Furthermore, since the function of 
5 transmitting only updated portion of home page data from 
the WWW server side is lost, the connection time cannot 
be shortened. 
Summary of the Invention 

It is an object of the present invention to 
10 provided an Internet home page data acquisition method 
which can shorten the unnecessary wait time in 
connecting to a WWW server. 

In order to achieve the above object, 
according to the present invention, there is provided an 
15 Internet home page data acquisition method comprising 
the steps of transferring an access list held by a 
client to a cache URL address storage server, acquiring 
home page data by a cache server on the basis of the 
transferred access list, and transferring the acquired 
20 home page data from the cache server to the client upon 
completion of acquisition of the home page data. 
Brief Description of the Drawings 

Fig. 1 is a view showing the arrangement of a 
network system to which an Internet home page data 
25 acquisition method according to the first embodiment of 
the present invention is applied; 

Fig. 2 is a view showing the arrangement of a 



network system to which an Internet home page data 
acquisition method according to the second embodiment of 
the present invention is applied; 

Fig. 3 is a block diagram showing the internal 
5 arrangement of a cache server in Fig. 2; 

Fig. 4 is a block diagram showing the internal 
arrangement of a cache URL address storage server in 
Fig. 2; 

Fig. 5 is a view showing a folder structure 
10 for managing an access list on the client side; 

Fig. 6 is a view showing management 
information on the cache URL address storage server 
side ; 

Fig. 7 is a flow chart for explaining the 
15 operation of the network system in Fig. 2; 

Fig. 8 is a view showing an Internet window to 
explain how URL addresses are set from a WWW home page 
according to the third embodiment of the present 
invention; and 

20 Fig. 9 is a view showing the arrangement of a 

network system to which an Internet home page 
acquisition method according to the fourth embodiment of 
the present invention is applied. 
Description of the Preferred Embodiments 

25 The present invention will be described in 

detail below with reference to the accompanying drawings. 
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[First Embodiment] 

Fig. 1 shows the arrangement of a network 
system to which an Internet home page data acquisition 
method according to the present invention is applied, 
5 and more specifically, an example of a cyclic data 
acquisition system. 

The network system of this embodiment includes 
a client 101 as a wire client and a client 102 as a 
wireless client, which browse home page data, a public 

10 network 104 to which the client 101 is connected, a base 
station 103 for performing radio communication with the 
client 102, a wireless network 105 to which the base 
station 103 is connected, an access server 107 for 
connecting only registered users to an Internet network 

15 111 through the public network 104, and an access server 
106 for connecting only registered users to the Internet 
network 111 through the wireless network 105. 

This network system also includes a cache URL 
address storage server 109 for storing and managing the 

20 URL addresses (cache URL addresses) of home pages for 

which cache requests are received from the client 102, a 
cache server 110 which is connected to the Internet 
network 111 to retain retrieved home pages, a local 
network 108 for networking the access servers 107 and 

25 106, cache server 110, and cache URL address storage 
server 109, and WWW servers 112, 113, and 114 for 
storing home page data and distributing the hypertext 



contents of the home pages worldwide through the 
Internet network 111. 

Fig. 1 shows only the three WWW servers 112, 
113, and 114 for the sake of descriptive convenience, 
5 even through countless WWW serves are connected to the 
Internet network 111. 

In this embodiment, to retain desired home 
page data in the cache server 110, first of all, the 
users of the clients 101 and 102 create access lists 
10 (cyclic lists) of the URL addresses of home pages which 
the users want to browse. 

These access lists are respectively retained 
in URL list retention folders in the clients 101 and 102. 
The respective URL list retention folders have their own 
15 attributes. Of these folders, one folder in the client 

101 and one folder in the client 102 are defined as 
server storage cyclic folders 101c and 102c, 
respectively. These server storage cyclic folders 101c 
and 102c are used to retain access lists (cyclic lists) 

20 of URL addresses from which the users want to 
collectively cache data. 

Browsers 101a and 102a in the clients 101 and 

102 create list data conforming to the format of the 
cache URL address storage server 109 on the basis of the 

25 access lists retained in the server storage cyclic 

folders 101c and 102c. The browsers 101a and 102a of 
the clients 101 and 102 transfer the created access 
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lists of cache URL addresses to the cache URL address 
storage server 109 at the time of line connection, and 
then disconnect the lines. 

The cache server 110 searches for home pages 
5 on the basis of the access lists transferred to the 

cache URL address storage server 109, and sequentially 
retains home page data upon finding the target URL 
addresses. When all the home page data are retained, 
the cache server 110 connects lines to the clients 101 

10 and 102 and issues data acceptance requests to the 
clients 101 and 102. Upon reception of the data 
acceptance requests, the clients 101 and 102 accept the 
home page data from the cache server 110 and retain them 
in caches 101b and 102b. 

15 According to another method, the clients 101 

and 102 periodically check the retained state of cache 
data in the cache server 110, and accept only cached 
data. In this case, on the basis of periods of time 
required for the acquisition of home page data and the 

20 like, the cache server 110 notifies the clients 101 and 
102 of the times at which data should be accepted. This 
makes it possible to avoid an unnecessary increase in 
traffic on the network. 

The cache server 110 also caches home page 

25 data in accordance with requests from the clients 101 
and 102. If, therefore, the cache server 110 retains 
all the home page data requested by the clients 101 and 



102 for the same period of time, the memory of the cache 
server 110 will be wasted. 

In order to solve this problem, different data 
retention times are set in accordance with the numbers 
5 of accesses to the respective home pages. In this case, 
the data of home pages to which larger numbers of 
accesses are made are retained for longer times. With 
regard to home pages which are accessed little, the data 
are discarded when the clients 101 and 102 accept them. 

10 This makes it possible to further reduce the traffic on 
the network. 

According to this embodiment, the clients 101 
and 102 send access lists of URL addresses of home pages, 
from which the clients want to accept data, to the cache 

15 URL address storage server 109, and the home page data 
are acquired by the cache server 110, which is always 
connected to the network, and transferred to the clients 
101 and 102. With this operation, the home page data 
can be acquired in short connection times. 

20 In addition, it suffices if access lists 

(cyclic lists) of URL addresses to be circulated are 
stored in folders, in the clients 101 and 102, to which 
a special meaning like "dedicated to circulation" is 
assigned. The access lists in the folders are 

25 transferred to the cache URL address storage server 109 
at the time of line connection. The cache server 110 
then acquires home page data on the basis of the 
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transferred access lists. This makes it possible to 
shorten the wait time in home page address search 
operation and the line connection time. 
[Second Embodiment] 
5 Referring to Fig. 2, reference numeral 201 

denotes a client including a browser 201a for browsing 
home page data and a cache 201b; 202, a local network to 
which the client 201 is connected; 203, a cache URL 
address storage server connected to the local network 
10 202; 204, a cache server connected to the local network 
202; 205, an Internet network to which the cache server 
204 is connected; and 206, a WWW server connected to the 
Internet network 205. 

The cache URL address storage server 203 
15 causes the cache server 204 to acquire home page data 
from the WWW server 206 on the basis of an access list 
(cyclic list) of cache URL addresses transferred from 
the client 201. 

As shown in Fig. 3, the cache server 204 
20 includes a target address access rate detecting section 
204a for checking the access state of URL addresses to 
be circulated, an elapsed time measuring section 204b 
for measuring the retention time of cache data, a home 
page data cache section 204c for storing acquired home 
25 page data, and a cache comparing section 204d for making 
a comparison to check whether the cached data is the 
latest data. 



As shown in Fig. 4, the cache URL address 
storage server 203 includes a cyclic URL address 
accepting unit 203a for accepting cache URL addresses 
(cyclic URL addresses) requested by a client, a cyclic 
5 URL address list retaining section 203b for retaining an 
accepted access list (cyclic URL address list) of cache 
URL addresses for each client, and a client ID managing 
section 203c for checking the ID of each client, 
managing information for ensuring an area for caching 
10 data for each client, and managing a data acceptance 
requesting method for each client. 

An access list of cache URL addresses 
transferred from the client 201 to the cache URL address 
storage server 203 is managed in units of folders. 
15 Fig. 5 shows the folder structure of an access list. 

As shown in Fig. 5, the access list of URL 
addresses is managed by dividing a folder into a server 
storage cyclic folder HI and client storage cyclic 
folder H2 . The access list stored in the server storage 
20 cyclic folder HI is transferred as an access list 

(cyclic list) of cache URL addresses to the cache URL 
address storage server 203. For the sake of descriptive 
convenience, two folders are used in this case, although 
the number of folders is not limited. 
25 When the cache server 204 is to be caused to 

actually perform circulation, the address information of 
the corresponding URL addresses is copied to the server 



storage cyclic folder HI. When the client 201 is to be 
caused to actually perform circulation, the address 
information of the corresponding URL addresses is copied 
to the client storage cyclic folder H2 . 
5 At the time of line connection, the client 201 

transfers the address information of "URL11", "URL12", 
and "URL13" stored in the server storage cyclic folder 
HI to the cache URL address storage server 203. Upon 
reception of this information, the cache URL address 

10 storage server 203 prepares for the acquisition of 
information . 

Fig. 6 shows management information (access 
list) in the cache URL address storage server 203. The 
cache URL address storage server 203 manages a cache 

15 state for each client. 

Referring to Fig. 2, the Internet network 205 
and WWW server 206 are known well by the person skilled 
in the art, and hence a description of the detailed 
arrangements will be omitted. 

20 The operation of the above network system will 

be described next with reference to the flow chart of 
Fig. 7. 

At the time of line connection (step S701) , 
the client 201 connects to the cache URL address storage 
25 server 203 (step S702) . After connection to the cache 
URL address storage server 203, the client 201 extracts 
cache URL addresses to be circulated from the server 



storage cyclic folder HI, and transfers them to the 
cache URL address storage server 203 (step S703) . 

Upon completion of the transfer of the URL 
addresses, the client 201 temporarily disconnects from 
5 the cache URL address storage server 203 (step S704), 
and disconnects the line (step S705) . The cache server 

204 acquires and caches home page data from home pages 
at the target URL addresses through the Internet network 

205 on the basis of the access list of cache URL 

10 addresses stored in the cache URL address storage server 
203. 

Obviously, data having the same date and size 
as those of data existing in the cache server 204 are 
regarded as identical data and hence are not acquired 
15 from the WWW server 206, and only new data are stored. 

If, however, all the data of accessed home 
pages are stored, a shortage of memory capacity occurs. 
For this reason, the retention time of cache data is 
changed in accordance with the access rate of each home 
20 page. 

The retention period of the data of a home 
page with a high access rate is set be longer than that 
of the data of a home page with a low access rate. This 
makes it possible to efficiently cache data. Note that 
25 the access rate of each home page may be calculated on 
the basis of the frequency of occurrence of URLs with 
which the client 201 requests the acquisition of data. 
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According to this embodiment, the cache server 
204 that is always connected to the Internet has the 
data acquisition function, and the client 201 requests 
the cache server 204 through the cache URL address 
storage server 203 to acquire home page data. When all 
home page data are acquired, the client 201 accepts the 
data from the cache server 204. With this operation, 
the wait time in accessing the WWW server can be 
shortened, thus shortening the connection time of a 
telephone line or the like. 

In addition, a folder in the client 201 is 
assigned a special meaning like "dedicated to 
circulation", and operation is performed according to 
the meaning assigned to the folder at the time of line 
connection, thereby easily setting URL addresses to be 
circulated . 
[Third Embodiment] 

In the third embodiment, a further improvement 
is made on the method of transferring URL addresses to a 
cache URL address storage server 203. More specifically, 
as shown in Fig. 8, the message "Please input the 
addresses of home pages that you want to circulate" is 
displayed on an Internet window on a client 201 to allow 
the user to set URL addresses on a WWW home page. Note 
that the basic arrangement of the third embodiment is 
the same as that of the second embodiment, and hence a 
description thereof will be omitted. 



According to this embodiment, by setting URLs 
on the home page, the user can set URL addresses by 
using a general WWW browser without requiring dedicated 
software for sending cache URL addresses to the cache 
5 URL address storage server 203. Note that the method of 
accepting stored URL data is the same as that used in 
the second embodiment. 
[Fourth Embodiment] 

Fig. 9 shows still another embodiment 
10 associated with the connection state of clients and 

cache servers. As shown in Fig. 9, a public network 904 
and wireless network 905 respectively have cache servers 
909 and 913 separately connected to an Internet network 
914. This arrangement makes it possible to transfer 
15 data to clients 901 and 902 by methods suitable for the 
respective networks. 

In some case, for example, the client 902 
designed to perform radio communication may have 
difficulty in accepting data depending on the state of a 
20 radio channel between the client and a disk unit 903. 
For this reason, data reduction and the like are 
executed in the cache server 913. This allows the 
client 902 to accept data in a shorter period of time 
than in accessing through the public network 904. In 
25 this manner, a data transfer method can be optimized in 
accordance with a network environment. 

Reference numerals 901a and 902a denote 



nm 



browsers; 901b and 902b, caches; 906 and 910, access 
servers; 907 and 911, local networks; 908 and 912, cache 
URL address storage servers; and 915, 916, and 917, WWW 
servers . 

5 As has been described above, according to the 

present invention, since a cache server that is always 
connected to the network is made to acquire home page 
data, the home page data can be acquired in a short 
connection time. This can shorten the unnecessary wait 

10 time when a client connects to a WWW server. 

In addition, since an access list to be 
transferred to a cache URL address storage server is 
retained in a server storage cyclic folder in a client, 
the data to be transferred to the cache URL address 

15 storage server are specified. This makes it possible to 
facilitate circulation. 
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What is claimed is: 



1. An Internet home page data acquisition 

2 method comprising the steps of: 

3 transferring an access list held by a client 

4 to a cache URL address storage server; 

5 acquiring home page data by a cache server on 

6 the basis of the transferred access list; and 

7 transferring the acquired home page data from 

8 said cache server to said client upon completion of 

9 acquisition of the home page data. 

2. A method according to claim 1, wherein 

2 said client includes a server storage cyclic 

3 folder for storing an access list to be transferred to 

4 said cache URL address storage server, and 

5 the step of transferring the access list 

6 comprises transferring the access list retained in said 

7 server storage cyclic folder to said cache URL address 

8 storage server. 

3. A method according to claim 1, wherein the 

2 step of transferring the home page data comprises the 

3 steps of: 

4 causing said cache server to connect a line to 

5 said client when acquisition of home page data is 

6 completed; and 
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7 transferring the acquired home page data from 

8 said cache server to said client after the line is 

9 connected. 



4. A method according to claim 1, wherein the 

2 step of transferring the home page data comprises the 

3 steps of: 

4 causing said client to connect a line to said 

5 cache server when acquisition of home page data is 

6 completed; and 

7 transferring the acquired home page data from 

8 said cache server to said client after the line is 

9 connected. 

5. A method according to claim 4, wherein the 

2 step of connecting the line comprises the step of 

3 periodically connecting the line to said cache server. 

6. A method according to claim 4, wherein 

2 the method further comprises the step of 

3 causing said cache server to notify said client of an 

4 acquisition end time when the access list is transferred, 

5 and 

6 the step of connecting the line comprises the 

7 step of connecting the line to said cache server at the 

8 notified acquisition end time. 
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Abstract of the Disclosure 



In an Internet home page data acquisition 
method, an access list held by a client is transferred 
to a cache URL address storage server. Home page data 
is acquired by a cache server on the basis of the 
5 transferred access list. The acquired home page data is 
transferred from the cache server to the client upon 
completion of acquisition of the home page data. 
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which a patent is sought on the invention entitled 

Internet Home Page Data Acquisition Method 

the specification of which 

(check one) 

f§ is attached hereto. 

H was filed on as United States Application No. or PCT International 

% Application Number 

M and was amended on 

in (if applicable) 

^hereby state that I have reviewed and understand the contents of the above identified specification, 
including the claims, as amended by any amendment referred to above. 

^knowledge the duty to disclose to the United States Patent and Trademark Office all information 
ffiown to me to be material to patentability as defined in Title 37, Code of Federal Regulations, 
Section 1 .56. 

I hereby claim foreign priority benefits under Title 35, United States Code, Section 1 1 9(a)-(d) or 
Section 365(b) of any foreign application(s) for patent or inventor's certificate, or Section 365(a) of 
any PCT International application which designated at least one country other than the United States, 
listed below and have also identified below, by checking the box, any foreign application for patent or 
inventor's certificate or PCT International application having a filing date before that of the application 
on which priority is claimed. 

Prior Foreign Application(s) Priority Not Claimed 



124132/1999 Japan 30/04/1999 □ 

(Number) (Country) (Day/Month/Year Filed) 
□ 

□ 



(Number) (Country) (Day/Month/Year Filed) 



(Number) (Country) (Day/Month/Year Filed) 
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I hereby claim the benefit under 35 U.S.C. Section 119(e) of any United States provisional 
application(s) listed below: 



(Application Serial No.) 


(Filing Date) 


(Application Serial No.) 


(Filing Date) 


(Application Serial No.) 


(Filing Date) 



I hereby claim the benefit under 35 U. S. C. Section 120 of any United States application (s), or 
Section 365(c) of any PCT International application designating the United States, listed below and, 
insofar as the subject matter of each of the claims of this application is not disclosed in the prior 
United States or PCT International application in the manner provided by the first paragraph of 35 
U.S.C. Section 112, I acknowledge the duty to disclose to the United States Patent and Trademark 
Office all information known to me to be material to patentability as defined in Title 37, C. F. R., 
Section 1 .56 which became available between the filing date of the prior application and the national 
® PCT International filing date of this application: 





(Application Serial No.) 


(Filing Date) 


(Status) 








(patented, pending, abandoned) 




(Application Serial No.) 


(Filing Date) 


(Status) 








(patented, pending, abandoned) 




(Application Serial No.) 


(Filing Date) 


(Status) 








(patented, pending, abandoned) 



I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these statements 
were made with the knowledge -that willful false statements and the like so made are punishable by 
fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that such 
willful false statements may jeopardize the validity of the application or any patent issued thereon. 



Form PTO-SB-01 (6-95) (Modified) 



Patent and Trademark Office-U.S. DEPARTMENT OF COMMERCE 



Page 3 of 3 



POWER OF ATTORNEY: As a named inventor, I hereby appoint the following attorney(s) and/or 
agent(s) to prosecute this application and transact all business in the Patent and Trademark Office 
connected therewith, (list name and registration number) 

Stephen D. Murphy, Reg. No.: 22,002 Paul J. Esatto, Reg. No.: 30,749 

William C. Roch, Reg. No.: 24,972 Mark J. Cohen, Reg. No.: 32,211 

Frank S. DiGiglio, Reg. No.: 31,346 Donald T. Black, Reg. No.: 27,999 



Send Correspondence to: 



Paul J. Esatto, Jr. 
Scully, Scott, Murphy & Presser 
400 Garden City Plaza 
Garden City, New York 11530 



(greet Telephone Calls to: (name and telephone number) 
Paul J. Esatto, Jr. (516) 742-4343 



AJFull name of sole or first inventor 

□ Mitsuhiro Watanabe 



UlSole or first inventor's signature ^ , /) _ , ft ■" ' *\ Date Apr.l7, 
'~/Kb£tedu/lr /T&^u£^w'y 2000 



Tokyo, Japan 



Japan 



p-iPost Office Address 

c/o NEC Corporation, 7-1, Shiba 5-chome, Minato-ku, 



Tokyo, Japan 



Full name of second inventor, if any 



Second inventor's signature 



Post Office Address 
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